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1. Introduction 

1.1. Motivations 

Multiple testing is a long-established topic in statistics which has seen a surge of interest 
in the past two decades. This renewed popularity is due to a growing range of applications 
(such as bioinforniatics and medical imaging) enabled by modern computational possi- 
bilities, through which collecting, manipulating and processing massive amounts of data 
in very high dimension has become commonplace. Multiple testing is in essence a multi- 
ple decision problem: each individual test output is a yes/no (or accept /reject) decision 
about a particular question (or null hypothesis) concerning the generating distribution 
of some random observed data. 

The standard framework for multiple testing is to consider a finite family of hypothe- 
ses and associated tests. However, in many cases of interest, it is natural to interpret 
the observed data as the discretization of an underlying continuously-indexed random 
process; each decision (test) is then associated to one of the discretization points. A 
first example is that of detecting unusually frequent words in DNA sequences: a classical 
model is to consider a Poisson model for the (non-overlapping) word occurrence process 
(Robin, 2002), the observed data being interpreted as a discretized version of this pro- 
cess. A second example is given in the context of medical imaging, where the observed 
pixelized image can be interpreted as a sampled random process, and the decision to 
take is, for each pixel, whether the observed value is due to pure noise or reveals some 
relevant activity (pertaining to this setting, see in particular the work of Pcronc Paciiico 
(-t al, 2004, 2007; see also Scliwartznian ct al., 201f). 

Therefore, the present paper explores multiple testing for a (possibly) uncountably 
infinite set of hypothesis. With some abuse of language, we will refer to this as the con- 
tinuous setting and use loosely the word "continuous" in reference to sets in order to 
mean: possibly uncountably infinite. For the above examples, this corresponds to per- 
form testing for the underlying continuously-indexed process in a direct manner, without 
explicit discretization. 

1.2. Contribution and presentation of this work 

The principal contributions of the present work are the following. We first define a precise, 
but general in scope, mathematical setting for multiple testing over a continuous set of 
hypotheses, taking particular attention to specific measurability issues. Specifically, we 
focus on the extension to continuously-indexed observation (and decision) processes of 
so-called step-up multiple testing procedures, and the control of the (continuous analogue 
of) their false discovery rate (FDR), a type I error measure which has gained massive 
acceptance in the last 15 years for testing in high-throughput applications. To this end, we 
use the tools and analysis developed by (Blanchard and Roquain, 2008) (a programmatic 
sketch of the present work can be found in Section 4.4 of the latter paper). In particular, 
we extend suitably to the continuous setting the notion of positive regressively dependent 
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on a subset (PRDS) condition, which plays a crucial role in the analysis. The latter is 
a general type of dependence condition on the individual tests' p- values allowing to 
ensure FDR control. An important difference between the continuous and finite setting 
is that the continuous case precludes the possibility of independent p- values, which is the 
simplest reference setting considered in the finite case, so that a more general assumption 
on dependence structure is necessary (on this point, see the discussion at the end of 
Section 2.2). 

We have tried as much as possible to make this work self-contained, and accessible 
to readers having little background knowledge in multiple testing. We begin in the next 
section with an extended informal discussion of the framework considered in this paper 
in relation to existing literature on nonparametric testing. Sections 2 and 3 of the paper 
introduce the necessary notions for multiple testing with a an angle towards stochastic 
processes, and some specific examples for the introduced setting. The main result is ex- 
posed in Section 4, followed by its applications to the examples introduced in Section 2. 
The proof for the main theorem is found in Section 5. Extensions and discussions come 
in Section 6, while some technical results are deferred to Appendix A and to the supple- 
mentary material (Blancliaixl et al., 2011). Throughout the paper, the numbering of the 
sections and results of this supplement are preceded by "S-" for clarity (by writing, e.g.. 
Section S-1). 

1.3. Relationship to nonparametric statistics 

Multiple testing over a continuous set of hypotheses has natural ties with nonparametric 
statistics. In the present section, we briefly discuss this link and introduce informally our 
goals. We do not attempt a comprehensive survey of the very broad field of nonparametric 
testing, but rather emphasize some key specificities of the point of view adopted in the 
current work. 

Nonparametric testing. In order to be more concrete, consider the classical white 
noise model dZt = f{t)dt + adBt, where B is a Wiener process, t G [0, 1] (this model will 
be more formally considered as Example 2.3), with unknown drift function f £ J- , where 
J" is some a priori smoothness class. The problem of testing various hypotheses about / 
against nonparametric alternatives has received considerable attention since the seminal 
work of lugster (1982, 1993). The most common goal is to test one single qualitative 
null hypothesis, for instance: / is identically zero; / is nonnegative; / is monotone; / 
is convex. For concreteness, consider the first possibility, denoted := {/ = 0}. A 
common strategy to approach this goal is to consider a collection of test statistics of 
the form := J ip{t)dZt for some well chosen family ^ of test functions ip (normalized 
so that ||V'||2 — !)■ For each individual test function ip, we have ~ J\f{J f^,a^), 
so that this test statistic can be used for a Gauss test of the "local" null hypothesis 
Hy.= {feT;Jf^ = 0}DHl^ 

Intuitively, each statistic will have power against a certain type of alternative. Tak- 
ing into account simultaneously all test statistics over ^ G now constitutes a mul- 
tiple test problem. The simplest way to combine these is to consider Txj, := supi^^^, \T^\ 
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and reject whenever exceeds a certain quantile r of its distribution under the nuh 
H^. In multiple testing parlance, this is interpreted as testing over the hypothesis family 
{H^)^iz^, with weak control of the family-wise error rate (FWER), which is defined as the 
probability to reject falsely one or more of the considered null hypotheses. Here a local 
hypothesis is interpreted as rejected if |T^| exceeds r. The qualifier "weak" refers to 
the fact that this probability is controlled at a nominal level only under the global null 
rather than for all f E J-. 
This connection has been noted and discussed in the literature; for instance, Diimbgcn 
and Spokoiny (2001), using the above type construction and multiple testing interpre- 
tation, observe that "whenever the null hypothesis is rejected, we have some infor- 
mation about where this violation occurs". To formalize this idea more precisely, define 
the rejection set Rr ■— {ip & ■ \T^\ > t} as the set of (indices of) hypotheses from the 
family (if°)^g^ which are deemed false. In order for Rr to be interpretable as intended, 
the threshold r should now be chosen so that for any f G T, the probability that the 
rejection set has non-empty intersection with (indices of) the set of hypotheses satis- 
fied by /, I")/) G : / G ^°|; is less than a nominal level. This is called strong control 
of the FWER. This point of view appears to have been only seldom considered explic- 
itly in nonparametric testing literature; a recent example is the work of Scimiidt-Hicber 
et al. (2011) (in the framework of density deconvolution) , wherein each individual null 
hypothesis H'^ has a qualitative interpretation in terms of / being monotone on some 
subinterval. 

Type I error criteria. For adequate (weak or strong) control of the FWER in the 
example above, it is clear that a stochastic control of the deviations of the supremum 
statistic T^s, is necessary; this, in turn, depends on the complexity of the set ^' (typically 
measured through L2 metric entropy). As a consequence, FWER-controUing procedures 
will be more conservative as the complexity of the family of the underlying test increases; 
for instance, in a d-dimensional version of the above example, or if we simply consider a 
longer observation interval, the threshold r would have to be more conservative (larger) 
to obtain control of the FWER at the same level. 

Motivated by the multiple testing point of view, we consider alternative, less stringent 
type I error criteria. Let us still consider the white noise example, in a slightly modified 
setup where we are specifically interested in the family of null hypotheses (-ff^^) ^^j^ 

with := {/ G J-"; / (t) = 0}. Each individual null hypothesis can be tested using a 
statistic as defined above; more precisely, this statistic can test for the null H^^ := 
{f £ J- : J fipt G [— 15, 5]} {tpt and 6 being chosen adequately so that C H^^ holds). 
Define similarly to earlier Rr := {t G [0, 1] : |r^J > r}, and introduce Fr{f) := Rt n 
{t G [0, 1] : / G iJ"} the set of incorrectly rejected hypotheses when the true drift is 
/. To reiterate, FWER control is the requirement that P/ [Fr{f) 7^ 0] is bounded at a 
nominal level for all f d J-. Consider now the weaker requirement that the average size of 
Fr{f) (measured through its Lebesgue measure, denoted |.|) is bounded at some nominal 
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level. We observe via an application of Fubini's theorem: 

% [\Fr[m = [|T^J > r] 1{/ e H^,)dt- 

JQ 

the averaged false reject size is simply the individual test error level, integrated over 
the null hypothesis family. It is the continuous analogue of the so-called per- comparison 
error rate (PCER) in multiple testing. To control this quantity, clearly no multiple testing 
correction is necessary, and it is sufficient to choose r so that each individual test has 
Type I error controlled at the desired level. This criterion, however, is not very useful in 
practice: only if the volume the rejection set is much larger than the nominal expected 
volume of errors, can we have some trust that the rejection set contains interesting 
information. To address this issue, introduce the average volume proportion of falsely 
rejected hypotheses: 

FDR(i?„ /) := Ey [FDP(i?,, /)] , where FDP(i?,, /) = ^^j^, (1) 

\Kr\ 

with the convention 2 = i. xhe acronyms FDP and FDR stand for false discovery 
proportion and rate, repectively, and the above are the continuous generalization of cor- 
responding criteria introduced for finite hypothesis spaces by Bcnjaniini and Hocliljcrg 
(1995) for a finite number of hypotheses, and which have gained widespread acceptance 
since. Controlling the FDR is a more difficult task than for the PCER, because of the 
random denominator inside the expectation, and is the main aim of this paper. 

As we shall see, a crucial difference of the FDR criterion with respect to the FWER 
is that, if a family of tests of the individual hypotheses is known, there exist relatively 
generic procedures called step-up to combine individual tests into a FDR-controlled mul- 
tiple testing, by finding an adequate, data-dependent rejection threshold. In particular, 
these procedures do not depend on an intrinsic complexity measure of the initial family, 
nor of the control of deviations of suprema of statistics. 

To conclude these considerations, it is worth noting that a FDR-controlled proce- 
dure can also be used for the goal of testing the single "global null" hypothesis = 
ntg[o.i]i?°, as in the opening discussion. Namely, if Rt is a procedure whose FDR is 
controlled at level a, we can reject the global null hypothesis H'^ whenever > 0. 
Under the global null, the FDP takes only the values or 1 and precisely coincides with 
> 0}; thus, its expectation is the probability of type I error for testing the global 
null this way, and is bounded by a. That this can be achieved by a generic procedure 
without explicitly considering the deviations of a supremum process can seem surprising 
at first. Since the focus of this paper is centered on the multiple hypothesis testing point 
of view, we will not elaborate on this issue further, although a power comparison with 
the "standard" approaches would certainly be of interest. 

Related work. The continuous FDR criterion using volume ratios was introduced 
before by Pcrono Pacifico ct al. (2004, 2007) to test nonnegativity of the mean of a 
Gaussian field. In that work, the authors follow a two-step approach where the first 
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step consists in a FWER-controlled multiple testing based on suprema statistics, as 
delineated above. This is then used to define an upper bound on the FDP holding with 
large probability, following the principle of so-called augmentation procedures (van der 
Laan et al., 2007) for multiple tests. An advantage of this approach is that a control of 
the FDP holding with large probability is obtained (which is stronger than a bound on 
the FDR, its expectation); on the other hand, the authors observe that since the first step 
is inherently based on FWER control, it is more conservative than a step-up procedure. 
In the present work, we focus on step-up procedures, for which a probabilistic control of 
the deviations of suprema statistics is not needed; this allows us also to address directly 
a broader range of applications. 

2. Setting 

2.1. Multiple testing: mathematical framework 

Let X be a random variable defined from a measurable space (Q,^) to some observation 
space {X,X). We assume that there is a family of probability distributions on (^2,3^) that 
induces a subset V of probability distributions on {X, X), which is called the model. The 
distribution of X on {X, X) is denoted by P; for each P E V there exists a distribution 
on (f2, ^) for which X ^ P; it is referred to as Pjc ~p or simply by P when unambiguous. 
The corresponding expectation operator is denoted Ex~p or E for short. 

We consider a general multiple testing problem for P, defined as follows. Let H denote 
an index space for (null) hypotheses. To each h EH is associated a known subset Hh C 
V of probability measures on {X,X). Multiple hypothesis testing consists in taking a 
decision, based on a single realization of the variable X, of whether for each h E H 
it holds or not that P e Hk (which is read "P satisfies Hh\ or "Hh is true"). We 
denote by 'Ho{P) :— {h E V. : P satisfies Hh} the set of true null hypotheses, and by 
its complementary 'Hi(P) := "H \ 'Ho(P) the set of false nulls. These sets are of course 
unknown because they depend on the unknown distribution P. For short, we will write 
sometimes Ho and Hi instead olHo{P) and Hi{P), respectively. 

As an illustration, if we observe a continuous Gaussian process X = {Xfi)helo,i]'^ with 
a continuous mean function : h E [0, l]"* i— > /i(i) EXt, then P is the distribution of 
this process, {X,X) is the Wiener space and V is the set of distributions generated by 
continuous Gaussian processes having a continuous mean function. Typically, H = [0, 1]'^ 
and, for any h, we choose H/^ equal to the set of distributions in P for which the mean 
function ^ satisfies /Lt(/i) < 0. This is usually denoted Hh'- "ti{h) < 0". Finally, the set 
^o(P) — {h & [0, 1]'' : ii{h) < 0} corresponds to the true null hypotheses. Several other 
examples are provided below in Section 2.4. 

Next, for a more formal definition of a multiple testing procedure, we first assume the 
following: 

The index space H is endowed with a cr-algebra Sj and for all P £ P, 
the set Ho{P) of true nulls is assumed to be measurable, that is, Ho{P) E Sj. 

(Al) 
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Definition 2.1 (Multiple testing procedure). Let X : (ri,5^) — > be a random 

variable, V a model of distributions of X , andJi an index set of null hypotheses. Assume 
(Al) holds. A multiple testing procedure on H is a function R : X[fl) G X ^ Sj such 
that the set 

{{uj,h) enxn-. heRiX{uj))} 

is a ^ <Si ^-measurable set; or in other terms, that the process {l{h G R{X)})ii^f{ is a 
measurable process. 

The fact that R need only be defined on the image X{n), rather than on the full space 
A", is a technical detail necessary for later coherence; this introduces no restriction since 
R will only be ever applied to possible observed values of X. 

A multiple testing procedure R is interpreted as follows: based on the observation 
X = X{uj), R{x) is the set of null hypotheses that are deemed to be false, also called set 
of rejected hypotheses. The set HoiP) H R{x) formed of true null hypotheses that are 
rejected in error is called the set of type I errors. Similarly, the set 'Hi(P) H R'^{x) is that 
of type II errors. 

2.2. The p-value functional and process 

We will consider a very common framework for multiple testing, where the decision for 
each null hypothesis H^, h ^H, is taken based on a scalar statistic Ph{x) € [0, 1] called 
a p-value. The characteristic property of a p-value statistic is that if the generating 
distribution P is such that the corresponding null hypothesis is true (i.e. h € 'Ho{P)), 
then the random variable PhiX) should be stochastically lower bounded by a uniform 
random variable. Conversely, this statistic is generally constructed in such a way that if 
the null hypothesis Hh is false, its distribution will be more concentrated towards the 
value 0. Therefore, a p- value close to is interpreted as evidence from the data against 
the validity of the null hypothesis, and one will want to reject hypotheses having lower 
p-values. Informally speaking, based on observation x, the construction of a multiple 
testing procedure generally proceeds as follows: 

(i) compute the p-value Ph{x) for each individual null index h £%. 

(ii) determine a threshold th{x) for each h £ Ji, depending on the whole family 

{Ph{x))he-H- 

(iii) put R{x) ^{hen-. Ph{x) < th{x)}. 

To summarize, the rejection set consists of hypotheses whose p-values are lower than a 
certain threshold, this threshold being itself random, depending on the observation a; 
and possibly depending on h. This will be elaborated in more detail in Section 3.2, in 
particular how the threshold function th{x) is chosen. For now, we focus on properly 
defining the p-value functional itself, the associated process, and the assumptions we 
make on them. 

Formally, we define the p-value functional as a mapping p : X — > [0, 1]^, or equiva- 
lently as a collection of functions p = {ph{x))h£n, each of the functions ph : X [0, 1], 
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h £ %, being considered as a scalar statistic that can be computed from the observed 
data X € X. 

We will consider correspondingly the random p- values w G i— > and p- value 

process w € i— > p(X(aj)). With some notation overload, we will sometimes drop the 
dependence on X and use the notation and p to also denote the random variables 
Ph{X) and p(^) (the meaning - function of x, or random variable on $7 - should be clear 
from the context). 

We shall make the following assumptions on the p-value process: 

• Joint measurability over f2 x "H: we assume that the random process {ph{X))h^'H 
is a measurable process, that is: 

(w, h) e {VL X U,^ ® S)) ^ ph{X{uj)) e [0, 1] is (jointly) measurable. (A2) 

• For any P d V, the marginal distributions of the p- values corresponding to true 
nulls are stochastically lower bounded by a uniform random variable on [0, 1]: 

yh e Ho{P), yu e [0, 1], Fx^p{Ph{X) <u)<u. (A3) 

(The distribution of ph if h lies in 'Hi{P) can be arbitrary). 

Condition (A2) is specific to the continuous setting considered here and will be dis- 
cussed in more detail in the next section. Condition (A3) is the standard characterization 
of a single p- value statistic in classical (single or multiple) hypothesis testing. In general, 
an arbitrary scalar statistic used to take the rejection decision on hypothesis Hh can be 
monotonically normalized into a p- value as follows: assume Suix) is a scalar test statistic, 
then 

Ph{x) = sup Fh^p{Sh{x)) 

is a p- value in the sense of (A3), where Fh.p{t) = Pxr^p{Sh{X) > t) (and where the 
supremum is assumed to maintain the measurability in x, for any fixed h). If the scalar 
statistic Sh{x) is constructed so that it tends to be stochastically larger when hypothesis 
Hh is false, the corresponding p-value indeed has the desirable property that it is more 
concentrated towards in this case. Such test statistics abound in the (single) testing 
literature, and a few examples will be given below. 

2.3. Discussion on measurability assumptions 

Since the focus of the present work is to be able to deal with uncountable spaces of 
hypotheses H, we have to be somewhat careful with corresponding measurability as- 
sumptions over "H (a problem that does not arise when "H is finite or countable). The 
main assumption needed in this regard in order to state properly the results to come is 
the joint measurability assumption appearing in either Definition 2.1 (for the multiple 
testing procedure) or in (A2) (for the p- value process), both of which are specific to 
the uncountable setting. Essentially, joint measurability will be necessary in order to use 

imsart-bj ver. 2007/09/18 file: BDR_arXiv_verslon2.tex date: September 5, 2012 



Testing over a continuum with FDR control 



9 



Fubini's theorem on the space {p. x "HjS^i^i^), and have the expectation operator w.r.t. 
uj and the integral operator over % commute. 

If % has at most countable cardinality, and is endowed with the trivial cr-field compris- 
ing all subsets of %, then (A2) is automatically satisfied whenever all individual p- value 
functions ph : X ^ [0, 1], /i G "H, are separately measurable, which is the standard setting 
in multiple testing. 

If % is uncountable, a sufficient condition ensuring (A2) is the joint measurability of 
the p-value functional, 

(x,h) e [X X T-L,X ® S)) ^ ph{x) G [0,1] is (jointly) measurable, (A2') 

which implies (A2) by composition. Unfortunately, (A2') might not always hold. To see 
this, consider the following canonical example. Assume the observation takes the form 
of a stochastic process indexed by the hypothesis space itself, X = {Xh,h G H}. In 
this case, the observation space X is included in M^. Furthermore, assume the p- value 
function Ph{x) is given by a fixed measurable mapping ip of the value of x at point 
/i, i.e. Ph{x) = ipi^h), V/i G "H. In this case assumption (A2') boils down to the joint 
measurability of the evaluation mapping {x,h) G X xH !-> x/j. Whether this holds 
depends on the nature of the space X. We give some classical examples in the next 
section where the assumption holds; for example, it is true if X is the Wiener space. 

However, the joint measurability of the evaluation mapping does not hold if X is 
taken to be the product space endowed with the canonical product a-field (indeed, 
this would imply that any x G M^, i.e., any function from H into M, is measurable). 
The more general assumption (A2) may still hold, though, but it generally requires 
some additional regularity or structural assumptions on the paths of the process X. In 
particular, in the above example ii X = {X^^h G H} is a stochastic process having a 
(jointly) measurable modification (and more generally for other examples, if there exists 
a modification of X such that (A2) is satisfied), we will always assume that we observe 
such a modification, so that assumption (A2) holds. 

We have gathered in Section S-1 of the supplementary material Blanchard ct al. (2011) 
some auxiliary (mostly classical) results related to the existence and properties of such 
modifications. Lemma S-1. 2 shows that such a (jointly) measurable modification exists 
as soon as the process is continuous in probability. The latter is not an iff condition, but 
is certainly much weaker than having continuous paths. 

On the other hand, it is important to observe here that a jointly measurable modifi- 
cation of X, or, for that matter, of the p value process, might not exist. Lemma S-1.1 
reproduces a classical argument showing that for H = [0,1], assumption (A2) is vio- 
lated for any modification of a mutually independent p-value process. Therefore, for an 
uncountable space of hypotheses H, assumption (A2) precludes the possibility that the 
p- values {ph, h G H} are mutually independent. This contrasts strongly with the situa- 
tion of a finite hypothesis set H, where mutual independence of the p- values is generally 
considered the reference case. 

A final issue is to which extent the results exposed in the remainder of this work 
depend on the (jointly) measurable modification chosen for the underlying stochastic 
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process. Lemma S-1.4 elucidates this issue by showing that this is not the case, because 
the FDR (the main measure of type 1 error, which will be formally defined in Section 3.1 
) is identical for two such modifications. 

2.4. Examples 

To illustrate the above generic setting, let us consider the following examples. 

Example 2.2 (Testing the mean of a process). Assume that we observe the realization 
of a real-valued process X = (-'^t)te[o,i]'^ with an unknown (measurable) mean function 
H : t E [0,1]'' i-> fJ.{t) EXt. We take H = [0,1]'^ and want to test simultaneously 
for each t E [0,1]"* the null hypothesis Ht : "/i(t) < 0". Assume that for each t the 
marginal distribution of {Xt — fJ-{t)) is known, does not depend on t and has upper-tail 
function G (for instance, X is a Gaussian process with marginals Xt ^ A/'(/i(t), 1)). We 
correspondingly define the p-value process Vi E [0,1]'', Pt{X) — G{Xt), which satisfies 
(A3). Next, the measurability assumption (A2) follows from a regularity assumption on 
X: 

• if we assume that the process X has continuous paths, X : uj i-^ {Xt{Lj))t can be 
seen as taking values in the Wiener space X = C[o,i]d = C([0, 1]'',M) of continuous 
functions from [0, 1]'' to M. (In this case, the Borel cr-field corresponding to the 
supremum norm topology on C[o_i]d is the trace of the product cr-ficld on Cjo^ijd, 
and X is measurable iff all its coordinate projections are.) Furthermore, the p- value 
function can be written as 

(x, t) E C[o,i]. X [0, l]'^ ^ pt{x) = G{x{t)) E [0, 1]. 

The evaluation functional {x^t) E Cjo^ijd x [0,1]'' ^ x{t) is jointly measurable 
because it is continuous, thus Ptix) is jointly measurable by composition and (A2') 
holds, hence also (A2). 

• if d = 1 and the process X is cadlag , the random variable X can be seen as taking 
values in the Skorohod space X = 1) := I?([0, 1],M) of cadlag functions from [0, 1] 
to M. In this case, the Borel cr-field generated by the Skorohod topology is also 
the trace of the product cr-field on V (see, e.g.. Theorem 14.5 p. 121 of Billingslcy, 
1999). Moreover, the evaluation functional {x,t) H> x{t) is jointly measurable, as 
for any cadlag funtion x, it is the pointwise limit of the jointly measurable functions 
C„: {x, t) ^ Cn(a;, t) := YX^i x{k2''')l{{k - 1)2"" < t < k2-"} + x{l)l{t - 1} , 
therefore (A2') is fulfilled by composition, hence also (A2). 

• assume that AT is a Gaussian process defined on the space X = endowed with 
the canonical product cr-field, and with a covariance function E(t,t') such that S 
is continuous on all points (t, t) of the diagonal and takes a contant (known) value 
cr^ on those points. 

This assumption is not sufficient to ensure that X has a continuous version, but 
it ensures that {Xt) is continuous in and hence in probability; Lemma S-1.2 
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then states that X has a modification such that the evaluation functional is jointly 
measurable. Assuming that such a jointly measurable modification is observed, we 
deduce that (A2) holds for the associated p-value process. 

Example 2.3 (Testing the signal in a Gaussian white noise model). Let us consider 
the Gaussian white noise model dZt — f{t)dt + crdBt, t G [0,1], where i? is a Wiener 
process on [0,1] and / £ C([0, 1]) is a continuous signal function. For simplicity, the 
standard deviation a is assumed to be equal to 1. Equivalently, we assume that we can 
observe the stochastic integral of Zt against any test function in i^([0, 1]), that is, that 
we observe the Gaussian process (Arg)gg2.2([o,i]) defined by 

Xg:= I 9{t)f(t)dt+ f g{t)dBt, g e L^[0,l]). 







Formally, the observation space is the whole space X = M^^([°'^l\ endowed with the 
product cr-field. However, in the sequel, we will use the observation of the process X only 
against a "small" subspace of functions of i^([0, 1]). 

Let us consider H = [0, 1] and the problem of testing for each t E H, the null Ht : 
"/(*) < 0" (signal nonpositive) . We can build p- values based upon a kernel estimator in 
the following way. Consider a kernel function K G L^(K), assumed positive on [—1, 1] and 
zero elsewhere, and denote by Kt € i^([0, 1]) the function Kt{s) := K{{t — s)/t]), where 
< 77 < 1 is a bandwidth to be chosen. Let us consider the process Xt := XKt,t G [0, 1]. 
From Lemma S-1.3, X has a modification which is jointly measurable in (a;,t). Clearly, 
this implies that there exists a modification of the original process X such that X is 
jointly measurable in (w, t), and we assume that we observe such a modification. For any 
tG [0,1], letting c;f,t := K{{t-s)/T])ds > andvK.t J^^ K^{{t-s)/r])ds > c^.^ > 0, 
we can consider the following standard estimate of f{t): 

/•I /+ \ /■! / 4. 



Assume that there is a known 6t^n > such that for any t with f{t) < 0, we have the 
upper-bound 



1 ^t-s 



^Ut)^cj/^tJ^ K\^—jf{s)ds<St,,. (3) 

For instance, this holds if we can assume a priori knowledge on the regularity of /, of 
the form sups.|^_j|<^ l/(s) — < ^t,??- Then, the statistics {f-q{t))t can be transformed 
into a p- value process in the following way: 



/cK,i 
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where := V{W > w), W ^ A/'(0, 1), is the upper tail distribution of a standard 

Gaussian distribution. The p- value process (4) satisfies (A3), because for any t with 
f{t) < and any u e [0, 1], 

ViptiX) <u)= V{f^{t) - St,^ > v]I^JcK^t'^'\u)) 

< ncKAMt) - E7r,w)/«K? > ^~\u)) 

= u, 

because Kt{s)dBs ^ A/'(0, v^.t)- Moreover, thep-value process (4) satisfies (A2), since 
we assumed {XKt)t G [0, 1] to be jointly measurable in (w, t). 

Example 2.4 (Testing the c.d.f.)- Let X = [Xi, . . .,X„^) G A" = be a m-uple of 
i.i.d. real random variables of common continuous c.d.f. F. For % = I a,n interval of K 
and a given benchmark c.d.f. -Fq, we aim to test simultaneously for alH G / the null : 
" F{t) < Fo{ty\ The individual hypothesis Ht may be tested using the p-value 

PtiX)^Gt{m¥miX,t)), (5) 

where ¥„i{X, t) = mT^ TllLi ^ is the empirical c.d.f. of Xi, . . . , X„i and where 

Gt{k) — V[Zt> fc], Zt ^ B{m, Fo{t)), is the upper-tail function of a binomial distribution 
of parameter (m, Fo(i)). The conditions (A2) and (A3) are both clearly satisfied. 

Figure 1 provides a realization of the p-value process (5) when testing for all t £ [0, 1] 
the null Ht : "F{t) < t" when F comes from a mixture of beta distributions. The 
correct/erroneous rejections are also pictured for the simple procedure R{X) = {t G 
[0,1] : pt{X)<OA}. 

Example 2.5 (Testing the intensity of a Poisson process) . Assume we observe {Nt)t£[o,i] € 
X = i:)([0,l],M) a Poisson process with intensity A : [0,1] -J> K+ G L'^{dA), where 
A denotes the Lebesgue measure on [0,1]. For each t G [0,1], we aim to test Ht : 
"A(t) < Ao(i)" where Ao(-) > is a given benchmark intensity. Assume that for a 
given bandwidth 77 G (0,1], there is a known upper bound 6t^r] for JI^^^^^^^ X{s)ds 
that holds true for any t such that A(t) < Xo{t). For instance, we can choose St.jj = 
((i + 77) A 1 — (t — 77) V Q){Xo{t) + supj,.|(_^|<,,^ |A(t) — A(s)l) (assuming knowledge on the 
regularity of A is available a priori). For any t G [0, 1], the variable N(^t_^^^/^i — iV(t_,,)vo 
follows a Poisson variable of parameter JI^^^^-I^q X{s)ds. Since the latter parameter is 
smaller than (5(_^ as soon as X{t) < Ao(t), the following p- value process satisfies (A3): 

Pt{X) = Gt{N^t+,,)M - ^(t-„)vo), (6) 

where for any fc G N, Gt{k) denotes P[Z > k] for Z a Poisson distribution of parameter 
St^rf- Moreover, the p- value process fulfills condition (A2'), because (Nt) is a cadlag 
process, so that arguments similar to those of Example 2.2 apply. Thus (A2) also holds. 
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Figure 1. Plot of a realization of the p-valuc process as defined in (5) for the c.d.f. testing, together 
with Fq and F, for Fo(t) = t and F{t) = 0.5Fi(t) + 0.5F2(i), where Fi (resp. F2) is the c.d.f. of a beta 
distribution of parameter (0.5, 1.5) (resp. (1.5, 0.5)). The region where the null hypothesis "F{t) < Fo{t)" 
is true is depicted in grey color. The crosses correspond to the elements of {Xi, 1 < i < m}; m = 50. 
The correct/erroneous rejections refer to the procedure R{X) = {t S [0,1] : pt{X) < 0.4} using the 
threshold 0.4. 



3. Main concepts and tools 
3.1. False discovery rate 

Following the usual philosophy of hypothesis testing, one wants to ensure some control 
over type I errors commited by the procedure. As discussed in Section 1.3, in the present 
work we focus on a generalization to a continuum of hypotheses of the False Discovery 
Rate (FDR). For a finite number of null hypotheses, the FDR, as introduced by Ben- 
jainini and Hochl)erg (1995) (see also Scegcr, 19G8), is defined as the average proportion 
of type I errors in the set of all rejected hypotheses. To extend this definition to a possibly 
uncountable space, following Pcronc Pacifico ct al. (2004, 2007), we quantify this propor- 
tion by a volume ratio, defined with respect to a finite measure A on (T-ljS^) (the usual 
definition over a finite space is recovered by taking A equal to the counting measure). 

Definition 3.1 (False discovery proportion, false discovery rate) . Let A be a finite pos- 
itive measure on ('H,io). Let R be a multiple testing procedure on H . The false discovery 

imsart-bj ver. 2007/09/18 file: BDR_arXiv_verslon2.tex date: September 5, 2012 



14 



rate (FDR) of R is defined as the average of the the false discovery proportion (FDP): 
yPeV, yxe X{n) , FDPiRi.),P) ^^^^|^^l^j^l{A(i?(.)) > 0}, (7) 

and 

VP € P , FDR{R, P) := Ex^p [FDP{R(X), P)] . (8) 

The indicator function in (7) means that the ratio is taken equal to zero whenever 
the denominator is zero. Observe that, due to the joint measurabihty assumption in 
definition 2.1 of a muhiple testing procedure, both of the above quantities are well- 
defined (the FDP is only formally defined over the image of Q, through X since only 
on this set is the measurabihty of R{x) guaranteed by the definition. In particular, it is 
defined for P-almost all x £ A"). 

As illustration, in the particular realization of the p- value process pictured in Figure 1, 
if we denote by "Red" (resp. "Green") the length of the interval corresponding to the 
projection of the red (resp. green) part of the p-value process on the X-axis, the FDP 
of the procedure R{X) = G [0, 1] : pt{X) < 0.4} is Red/(Red + Green). A similar 
interpretation for the FDP holds in Figure 2. 

Finding a procedure R with a FDR smaller than or equal to a has the following 
interpretation: on average, the volume proportion of type I errors among the rejected 
hypotheses is smaller than a. This means that the procedure is allowed to reject in error 
some true nulls but in a small (average) proportion among the rejections. For a pre- 
specified level a, the goal is then to determine multiple testing procedures R such that 
for any P G V, it holds that FDR(i?, P) < a. (In fact, the statement need only hold for 
P eVn U/igH Hh, since outside of this set 'Ho(P) = and the FDR is 0.) The rest of 
the paper will concentrate on establishing sufficient conditions under which the FDR is 
controlled at a fixed level a. Under this constraint, in order to get a procedure with good 
power properties (that is, low type II error), it is, generally speaking, desirable that R 
rejects as many nulls as possible, that is, has volume A(i?) as large as possible. 

3.2. Step-up procedures 

In what follows, we will focus on a particular form of multiple testing procedures which 
can be written as function of the p- value family p{x) = [phixYjheH- 

First, we define a parametrized family of possible rejection sets having the following 
form: for a given threshold function IS. : (h,r) £ % y. h- > M+, we define for any r > 
the sub-level set 

\/x£X, LAix,r)~{heH : Phix)<Aih,r)}(Z-H. (9) 

For short, we sometimes write L/^{r) instead of La (2;, r) when unambiguous. We will more 
particularly focus on threshold functions A of the product form A{h,r) — aTr{h)f3{r), 
where a € (0, 1) is a positive scalar (level), tt :% ^ IR+ is measurable [weight function), 
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and /? : — > is nondecreasing and right-continuous {shape function). Clearly this 
decomposition is not unique, but will be practical for the formulation of the main result. 

Given a threshold function A of the above form, we will be interested in a particular, 
data-dependent choice of the parameter r determining the rejection set, called step-up 
procedure. 

Definition 3.2 (Step-up procedure). Let A{h,r) — a7r{h)f3{r) a threshold function 
with a € (0,1); tt : H — > IR+ measurable and /? : nondecreasing and right- 

continuous. Then the step-up multiple testing procedure R on (H,A) associated to A , is 
defined by 

yx e X{n) , R{x) = L/s,{x,?(x)) , where ?{x) := max{r > : K{L/:^{x,r)) > r} . (10) 

Note that r above is well-defined: first, since x G X{n) and from assumption (A2), 
the function h i— Ph{x) — a7r(/i)/?(r) is measurable; thus L/^{x,r) is a measurable set 
of "H, which in turn implies that A(L/^{x,r)) is well-defined. Secondly, the supremum of 
{r > : A{LA{x,r)) > r} exists because r = belongs to this set and M — A('H) is an 
upper bound. Third, this supremum is a maximum because the function r i— > A(L/^{x, r)) 
is nondecreasing (right-continuity is not needed for this). 

We should ensure in Definition 3.2 that a step- up procedure satisfies the measurability 
requirements of Definition 2.1. This is proved separately in Section 5.2. In that section, 
we also check that the equality A{LA{x,r{x))) = r{x) always holds. Hence, r{x) is the 
largest intersection point between the function r i— )• A{La(x, r)) giving the volume of the 
candidate rejection sets as a function of r, and the identity line r i— )• r. 

To give some basic intuition behind the principle of a step-up procedure, consider for 
simplicity that tt is a constant function, so that the family defined by (9) are ordinary sub- 
level sets of the p- value family. The goal is to find a suitable common rejection threshold t 
giving rise to rejection set Rf. Assume also without loss of generality that A('H) = 1. Now 
consider the following heuristic. If the threshold t is deterministic, any p- value associated 
to a true null hypothesis, being stochastically lower bounded by a uniform variable, has 
probability less than t of being rejected in error. Thus, we expect on average a volume 
tA{T-La) < t oi erroneously rejected null hypotheses. If we therefore use t as a rough upper 
bound of the numerator in the definition (7) of the FDP or FDR, and we want the latter 
to be less than a, we obtain the constraint t/A(Rt) < a, or equivalently A{Rt) > a^^t. 
Choosing the largest t satisfying this heuristic constraint is equivalent to the step-up 
procedure wherein /3(m) = u. The choice of a different shape function with /3(u) < u can 
be interpreted roughly as a pessimistic discount to compensate for various inaccuracies 
in the above heuristic argument (in particular the fact that the obtained threshold is 
really a random quantity). 

In the case where H is finite and A is the counting measure, it can be seen that the 
above definition recovers the usual notion of step- up procedures (see, e.g., Blancliard 
and Roquain, 2008); in particular, the linear shape function (3{u) = u gives rise to the 
celebrated linear step-up procedure of Benjaniini and Hochbcrg (1995). 



imsart-bj ver. 2007/09/18 file: BDR_arXiv_verslon2.tex date: September 5, 2012 



16 



3.3. PRDS conditions 

To ensure control of the FDR criterion, an important role is played by structural as- 
sumptions on the dependence of the p-values. While the case of independent p-values is 
considered as the reference setting in the case where 'H is finite, we recall that for an un- 
countable set T-L, we cannot assume mutual independence of the p- values since this would 
contradict our measurability assumptions (see concluding discussion of Section 2.3). 

We will consider two different situations in our main result: first, if the dependence 
of the p- values can be totally arbitrary, and secondly, if a form of positive dependence is 
assumed. This is the latter condition which we define more precisely now. We consider 
a generalization to the case of infinite, possibly uncountable space T-L, of the notion 
of positive regression dependence on each one from a subset (PRDS) introduced by 
Benjamini and Yekutieli (2001) in the case of a finite set of hypotheses. 

For any finite set I, a subset D C [0, 1]"^ is called nondecreasing if for all z, z' e [0, l]-^ 
such that z < z' (i.e. V/i € I, z/j < z^), we have z £ D ^ z' ^ D . 

Definition 3.3. (PRDS conditions for a finite p-value family) Assume % to he finite. 
For %' a subset ofH , the p-value family p{X) = (ph{X))h<£H is said to he weak PRDS 
on T-C for the distribution P, if for any h £ %' , for any measurable nondecreasing set 
D in [0,1]^, the function u € [0,1] P(p(X) G D \ ph{X) < u) is nondecreasing 
on {u G [0,1] : V{ph{X) < u) > 0} ; it is said to be strong PRDS if the function 
u I— >■ P(p(X) G D I ph{X) = u) is nondecreasing. 

To be completely rigorous, observe that the conditional probability with respect to the 
event {ph{X) < u} is defined pointwise unequivocally whenever this event has positive 
probability, using a ratio of probabilities; while the conditional probability with respect 
to Ph{X) = u can only be defined via conditional expectation, and is therefore only 
defined up to a p;i(X)-negligible set. Hence, in the definition of strong PRDS, strictly 
speaking, we only require that the conditional probability coincides ph{X)-a.s. with a 
nondecreasing function. 

Definition 3.4. (Finite dimensional PRDS conditions for a p-value process) For %' 
a subset of H , the p-value process p{X) = (ph[X))hi^H is said to be finite dimensional 
weak PRDS on T-L' (resp. finite dimensional strong PRDS on %' ) for the distribution P, 
if for any finite subset S ofH, the finite p-value family psiX) — (j>h{X))h(£-HnS is weak 
PRDS onT-L' r\S (resp. strong PRDS onT-C C]S) for the distribution P. 

While the finite dimensional weak PRDS property will be sufficient to state our main 
result, the strong PRDS property is sometimes easier to handle. Hence, it is important 
to note that the finite dimensional strong PRDS property implies the finite dimensional 
weak PRDS property, as we establish in Lemma S-2.2, by using a standard argument 
pertaining to classical multiple testing theory for a finite set of hypotheses. 

Finally, Benjamini and Yekutieli (2001) (Section 3.1 therein) proved that the p-value 
family corresponding to a finite Gaussian random vector are (finite) strong PRDS as 
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soon as all the coefficient of the covariance matrix are non-negative. This equivalently 
proves the following result: 

Lemma 3.5. Let p(X) — [ph{X))he'H be a p-value process of the form ph{X) — G{Xh), 
h € H, where X = {Xh)he'H is a Gaussian process and where G is continuous decreasing 
from R to [0, 1] . Assume that the covariance function "E of X satisfies 

yh,h' e n, j:{h,h') > o. (ii) 

Then the p-value process is finite dimensional strong PRDS (on any subset). 

4. Control of the FDR 

In this section, our main result is stated and then illustrated with several examples. 
4.1. Main result 

The following theorem establishes our main result on sufficient conditions to ensure FDR 
control at a specified level for step-up procedures. It is proved in Section 5. 

Theorem 4.1. Assume that the hypothesis space % satisfies (Al) and is endowed with 
a finite measure K. Letp{X) = (ph{X))heH be a p-value process satisfying the conditions 
(A2) and (A3). Denote R the step-up procedure on ("HjA) associated to a threshold 
function of the product form A{h,r) — aT:{h)/3{r) , with a G (0,1), /3 a non- decreasing 
right- continuous shape function and tt a probability density function on % with respect 
to A. Then for any P , letting n('Ho(-P)) := SheHoiP) ''^{h)dK{h), the inequality 

FDR{R,P) <an{HoiP)) (< ") (12) 
holds in either of the two following cases: 

1. /3{x) = X and the p-value process p is finite dimensional weak PRDS on 'Hq{P) for 
the distribution P; 

2. the function (3 is of the form 

Pi,{x) — / udv{u) , (13) 
Jo 

where v is an arbitrary probability distribution on (0,oo) . 

Since tt is taken as a probability density function on "H with respect to A, the FDR 
in (12) is upper bounded by an('Ho) < an('H) = a, so that the corresponding step-up 
procedure provides FDR control at level a. As an illustration, a typical choice for tt is 
the constant probability density function V/i S H,TT{h) — 1/A('H) = M^^. 
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According to the standard philosophy of (multiple) testing, while the FDR is con- 
trolled at level a as in (12), we aim to have a procedure that rejects a volume of hy- 
potheses as large as possible. In that sense, choosing a step-up procedure with /3(x) — x 
always leads to a better step- up procedure than choosing j3{x) of the form (13), because 

udv{u) < X. Hence, in Theorem 4.1, the PRDS assumption allows us to get a result 
which is less conservative (i.e. rejecting more) than under arbitrary dependencies. There- 
fore, when we want to apply Theorem 4.1, an important issue is to obtain, if possible, 
the finite dimensional PRDS condition, see the examples of Section 4.2. When the PRDS 
assumption does not hold, we refer to Blanchard and Roquain (2008) for an extended 
discussion on choices of the shape function /3 of the form (13) (which can be suitably 
adapted to the uncountable case). 



4.2. Applications 

4.2.1. FDR control for testing the mean of a Gaussian process 

Consider the multiple testing setting of Example 2.2. More specifically, we consider here 
the particular case where we observe {Xt,t g [0, 1]'*} a Gaussian process with measurable 
mean n, with unit variance and covariance function S. Recall that the problem is to test 
for all t S [0, l]'^ the hypothesis : "/J,(t) < 0". Taking for A the d-dimensional Lebesgue 
measure, the FDR control at level a of a step-up procedure of shape function /3 and weight 
function 7r(/i) = 1 can be rewritten as 



E 



A(^{te [0, ll-^ : M(i) < 0, $(At) < a/3{f{X))} ^ 



<a, (14) 



A(^{<e [Cll-i : $(Xt) <«/3(f(A))}^ 
where $ is the upper-tail distribution function of a standard Gaussian variable and 
f{X) = maxjr G [0,1] : A^{< e [0,1]'' : $(At) < a/3(r)}^ > r| . 

Thus, Theorem 4.1 and Lemma 3.5 entail the following result. 

Corollary 4.2. For any jointly measurable Gaussian process {Xt}^^^Q ^d over [0,1]'' 
with a measurable mean /i and unit variances, the FDR control (14) holds in either of 
the two following cases: 

• P{x) = X and the covariance function of the process is coordinates-wise non- 
negative, that is, satisfies (11); 

• P is of the form (13), under no assumption on the covariance function. 

For instance, any Gaussian process with continuous paths is measurable and thus can 
be used in Corollary 4.2. More generally. Lemma S-1.2 states that any Gaussian process 
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with a covariance function Yi{t,t') such that 




has a measurable modification and hence can be used in Corollary 4.2. 

4.2.2. FDR control for testing the signal in a Gaussian white noise model 

We continue Example 2.3, in which we observe the Gaussian process X defined by 
= Io9{t)f{t)dt + J^g{t)dBt, g e ^^([O,!]), where B is a Wiener process on [0, 1] 
and / S C([0, 1]) is a continuous signal function. Remember that we aim at testing Ht : 
''f{t) < 0" for any t e [0,1], using the integration of the process against a smooth- 
ing kernel Kt. Assuming condition (3) holds, the p- value process is obtained via (4) as 

\ 1/2 

Pt{X) — $ (Yt), where Yt = w^/ {Xxt — St^,jCK,t) is a Gaussian process. Applying 

Lemma 3.5, we can prove that the p- value process defined by (4) is finite dimensional 
strong PRDS (on any subset) by checking that the covariance function of (Yt)t has non- 
negative values: the latter holds because the kernel K has been taken nonnegative and 
Wt, s, Cov{Yt,Ys) — c K{{t — u)/rf)K{{s ~ u)/rf)du, for a nonnegative constant c. As a 
consequence. Theorem 4.1 shows that a step-up procedure using I3[x) — x controls the 



To illustrate this result, let us consider a simple particular case where the kernel K is 
rectangular, i.e., K{s) = l{|s| < l}/2 and / is L-Lipschitz. Also, to avoid the boundary 
effects due to the kernel smoothing, we assume that the observation X is made against 
functions of L^([— 1,2]) while the test of Ht : "/(i) < 0" has only to be performed for 
t e [0,1] only. In that case, for t e [0,1], 5t,r) = Lr], CK,t = VK,t = '7/2, so that 
Yt = {2rj)~^/'^{Zt+ri — Zt-ri — Lrf'). Therefore, the following statement holds. 

Corollary 4.3. Let us consider the Gaussian process Zt = J^^ f{s)ds + Bt, t € [—1, 2], 
where B is a Wiener process on [—1,2] and f is a L-Lipschitz function on [—1,2] (L>0). 
Let rj € (0, 1] and Yt — (27/)~^/^(Zf+,, — Zf_,, — Lif'). Denote the Lebesgue measure on 
[0, 1] by A. Gonsider the volume rejection of the step-up procedure using n(t) = 1 and 
P{x) = X, that is, 



where $ denotes the upper-tail distribution function of a standard Gaussian variable. 
Then the following FDR control holds: 



FDR. 



r{X) = max <^ r G [0, 1] : A {t G [0, 1] : <^{Yt) < ar] \ > r 





(15) 
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4-2.3. FDR control for testing the c.d.f. 

Consider the testing setting of Example 2.4 where we aim at testing whether "F{t) < 
Fo{t)" for any t in an interval J C M. Lemma A.l states that the p-value process defined 
by (5) is finite dimensional weak PRDS (on any subset). As a consequence, Theorem 4.1 
applies and leads to a control of the FDR. 

For instance, let us consider the simple case where / — [0, 1], Fo{t) = t and A is the 
Lebesgue measure on [0, 1]. In this case, for any k G {1, • . . ,m}, the function Gt{k) = 
V{Zt > k), with Zt ^ B{m, t), is continuous increasing in the variable t S [0, 1]. Moreover, 
for any t E (0, 1), the function Gt{k) is decreasing in fc = 0, . . . , to. Therefore, denoting 
= A'(o) < ATfi) < . . . < -'f(m) < X(^rn+i) = 1 the order statistics of Xi, . . . ,Xjn, the 
p- value process t i-)- pt{X) = Gt{\{l < i < to : Xi < t}\) is equal to 1 on [0,A'(i)), 
is increasing on each interval (A"(j-), X(^j^i^], j ~ 1, . . . , m, and is left-discontinuous and 
right-continuous in each Ar(j), 1 < j < to, with a left limit larger than pXf^j-^{X) = 
GxojO') (see Figure 1). 

As a consequence, for any threshold u G (0, 1), we obtain the following relation for the 
Lebesgue measure 7(u) of the level set {t e [0, 1] : PtiX) < u}: 

m 

j{u) = ^ l{Gx<,,(j-) < u} A{{t > X(,) : Gt{j) < u and t < 

where tj{u),j = 0, . . . , to is the unique solution of the equation Gt{j) = u, which can be 
easily computed numerically. Choosing for simplicity a uniform weighting ■n{x) = 1, the 
choice of the rejection threshold given by the linear step-up procedure is then u — ar, 
where r is the largest solution of the equation ^{ar) ~ r. To sum up, we have shown the 
following result: 



Corollary 4.4. Let X = (Xi, . . . , Xm) be a vector of m i.i.d. real random variables 
of common continuous c.d.f. F. Consider (pt(X))(g[o,i] p-value process Pt{X) = 
Gt{\{^ < i < m : Xi < t}\) for Gt{k) — ¥{Zt > k), where Zt is a binomial variable of 
parameters {m,t). Assume that the hypothesis space [0,1] is endowed with the Lebesgue 
measure A. Consider the volume rejection of the step-up procedure given by 

?{X) = max{r e [0, 1] : -f{ar) > r} , (17) 

where 7(-) is defined by (16). Then the following FDR control holds: 

^ l A{ {t g [0, 1] : F{t) < t, ptjX) < af{X)} ) 
A({te [0,1] : pt{X) < af{X)}) 



< a. 



(18) 
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4-2. 4- FDR control for testing the intensity of a Poisson process 

Let us consider the testing setting of Example 2.5. Lemma A. 2 states that the p-values 
process is finite dimensional strong PRDS (on any subset). Thus, it is also finite dimen- 
sional weak PRDS (on any subset) by Lemma S-2.2, and Theorem 4.1 leads to a control 
of the FDR. 

Now, we aim at finding a closed formula for the linear step-up procedure (/3(a;) = 
x) using the p-value process {pt{X))t. Let us consider the particular case where the 
benchmark intensity Ao(-) is constantly equal to some Aq > while A(-) is L-Lipschitz. 
Also, to avoid the boundary effects, assume that the process {Nt)t is observed for t G 
[—1,2] while Ht : "A(i) < Aq" is tested only for t G [0, 1]. In this case, the value process 
is simply given by 

PtiX)^G{Nt+^^Nt-^), (19) 

where for any k E N, G{k) denotes P [Z > fc] for Z a Poisson distribution of parameter 
2rj\Q + Lri^ (note that G(-) is independent of t). Consider the jumps {Tj}j of the process 
{Nt)t^[-i^2\ and the set S = {si}2<i<m of the the distinct and ordered values of the set 
Uj{T, — ?7, Tj -|- 77} n (0, 1). Moreover, we let si = and Sm+i = 1- Next, since the p-value 
process is constant on each interval [si, Si+i), 1 < i < to, we have for any u > 0, 

m 

A {{t £ [0, 1] : pt{X) < u}) = ^(s.+i - s,)l{ps,{X) < u} 

i=\ 
m 

= ^ Wkl{qa(k) (X) < u}, 

fc=l 

where we let qi{X) — ps.{X), where cr is a permutation of {1, . . . , to} such that qa-{i) ^ 
■ • • ^ 9o-(m) and where Wk = Sc(k)+i — s^^k) > can be interpreted as a "weighting" 
associated to qa{k)- As a consequence, we get 

f(X)= max jre [0,1] : f^wiliq.^i^iX) < ar} > r'^ 

{k '^1 
^ w;, for fc e {0, . . . ,to} s.t. qcr{k){X) <a'^wi \ , 
l=Q 1=0 J 



(20) 



because since r{X) is a maximum, it is of the form X]f=o ^ ^ {Oj • ■ • i ™}- Note that 
we should let go-(o) = and wo = to cover the case r{X) — 0. Relation (20) only 
involves a finite number of variables. Thus, r{X) can be easily computed in practice. 
This is illustrated in Figure 2. 

We have proved the following result: 

Corollary 4.5. Let X = {Nt)f^[_i 2] be a Poisson process with an intensity X : [—1,2] — )■ 
M+ L-Lipschitz (L > 0) and let Aq > 0. For rj G (0,1], consider the p-value process 
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{pt(X)}jgjo,i] gi'ven by (19). Assume that the hypothesis space [0, 1] is endowed with the 
Lehesgue measure A. Then r{X) defined by (20) satisfies the following: 



E 



A( {t g [0, 1] : \{t) < Aq, ptjX) < a f{X)} ) 
A({te [0,1] : pt{X)<af{X)}) 



< a. 



(21) 



To illustrate Corollary 4.5, Figure 2 displays the case where X{t) is a truncated trian- 
gular signal. The choice of the bandwidth 77 has been made manually, see Section 6.2 for 
a discussion on this point. 

Remark 4.6. Up to increase the set S — {si}i so that t 1-^ l{A(i) < Aq} is constant 
over each [s,, s,;_|_i), the FDR control (21) can be rewritten as 

"EIli(s»+i -s.)l{A(s.) < Ao}l{p..(X) <afiX)}- 



E 



< a. 



(22) 



Hence, the procedure (20) appears as controlling the discrete FDR- weighting on {1, ... , to} 
where the weight for rejecting "A(si) < Aq" is (si+i — Si) and where the initial p-values 
are qi{X) — ps.{X). The rationale behind this is that if qi{X) ~ ps-(X) is below r{X), 
then so are all pt{X), t G [si,Si+i). Hence a rejection for a p- value qi{X) = ps-{X) ac- 
counts for the length of the entire interval in the FDR. From an intuitive point of view, 
this means that the type I error importance in the FDR is larger for "isolated" points of 
the process. 

Finally, let us underline that (discrete) weighted FDR control results of the type (22) 
have been found in Benjaniini and Hochberg (1997) and Blanchard and Roquain (2008), 
but only for a weighting that does not dependent on the data. Here, since {si}i depends 
on X, these former results do not apply and Corollary 4.5 is a novel finding. 



5. Proof of Theorem 4.1 

5.1. Two conditions for controlling the FDR 

Similarly to Proposition 2.7 of Blancliaid and l(,o(iuani (2U0(S) (which we refer to as BROS 
for short from now on), we can prove that the FDR control FDR(i?, P) < aIl{Ho{P)) 
holds true for any P ^ V as soon as the two following sufficient conditions hold for any 
PeV: 

• the multiple testing procedure R satisfies the "self-consistency condition" 

R{x) c{hen : Ph{x) < aTT{h)f3{A{R{x)))} for P-almost a\\ x e X 

(SC(a,^,/3)) 

• for any h G Ho{P) the couple of real random variables {Uh, V) :— {ph{X), A{R{X))) 
satisfies the "dependence control condition" 



Vc> 0, E 



V 



1{V > 0} 



< c. 



(DC(/3)) 
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Figure 2. Several plots versus t £ [0,1]. Top left: A(t) (solid) and Aq (dashed). Top right: q^^^-^{X) 

and a X]f=o function of X]f=o ' k = 1, . . . ,m. Bottom: p- value process pt {X) defined by (19). 

Tj = 0.015, a = 0.4. The grey areas indicate regions where the null hypotheses are true. 
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The proof is as follows: by definition and by using Fubini's theorem, we have 

"A(i?n-Ho). 



FDR(i?,P) =E 

= E 



-l{A(i?) > 0} 



A(i?) 

/ ^^w';f^ l{A(fl) > 0}dA{h) 



ikeno MR) 
'l{h e R} 



E 



he-Ho 



l{A(i?) > 0} 



< 



E 



he-Ho 



MR) 

l{ph < aTT{h)l3{A{R))} 



MR) 



dA{h) 

l{A(i?) > 0} 



dA{h) 



< a 



TT{h)dA{h), 



he-Ho 



where we have used the shortened notation R for R{X) and ph for ph{X), and used 
successively conditions (SC(q;, tt, /?)) and(DC(/5)) for the two above inequalities. Ob- 
serve that the use of Fubini's theorem is granted by the measurability assumption of 
Definition 2.1. 

Therefore, to obtain the FDR bound of Theorem 4.1 in each case, we simply have to 
check conditions (SC(a, tt, /?)) and (DC(/3)) in the different settings. 



5.2. Any step-up procedure satisfies (SC(q;, tt, 

From the definition of a step-up procedure, for all e > 0, we have A{L^{r)) < A(LA{r + 
e)) < r + e. This entails that r satisfies A(La(?^)) = r. Hence the step-up procedure R 
satisfies SC(a,7r,/3) with equality. 

We now check that any step- up procedure is a multiple testing procedure, that is, that 
l{h€ RiX{uj))} = l{phiX{uj)) < aTT{h)(3i?{X{uj)))} is (jointly) measurable. 
From (A2) and since f3 and tt are measurable, it is enough to check that oj i— ?> r{X{uj)) 
is measurable. For any x E X{il.), let us consider the function 

/ : r e M+ A{La{x, r)) = / l{ph{x) < aTr{h)P{r)}dA{h). 

J-H 

We observe that / is right-continuous and nondecreasing (because (3 is) and bounded, 
and that r — max{r > : /(r) > r} . Applying Lemma S-2.3, we deduce that for any 

xex{n), 

r(x) = inf sup|reQ+ : A(LA(x,r))>r — e} 

E>0,eeQ ' 

inf sup ( rl{A(LA(a;,r)) > r-e} y (23) 

e>0,eGQrgQ+ \ / 

Since from (A2), for all £ > 0, e e Q and r e Q+, the function 

w ^rl{A{LAiX{Lo),r)) > r - e} = rl{A{{h G U : Ph{X{uj)) < a7r(/i)/3(r)}) > r - e} 
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is measurable, expression (23) implies that cj i— > r(X(a;)) is measurable. Hence, a step-up 
procedure satisfies the measurability requirements of Definition 2.1. 

5.3. Conditions implying (DC(/3)) 

We use the following lemma which was proved in (BROS) (see Lemma 3.2, items (ii,iii) 
therein) : 

Lemma 5.1. Let {U,V) he a couple of nonnegative random variables such that U is 
stochastically lower bounded by a uniform variable on [0, 1] , i.e. Vt e [0, 1], V{U < t) < t . 
Then the dependence control condition DC(/3) is satisfied by {U,V) in either one of the 
following situations: 

(i) j3{x) = X and 

Vr e M+, ¥{y <r\U <u) is nondecreasing on {u : P{U < u) > 0} . (24) 

(ii) The shape function /3 is of the form (13). 

The point (ii) above, together with the results of the two previous sections, establishes 
point 2 of Theorem 4.1. To establish point 1 and finish the proof, we have to prove that 
(24) holds in the finite dimensional weak PRDS dependence context, which is done in 
the following proposition: 

Proposition 5.2. Assume that the p-values process p = {ph,h G H) is finite dimen- 
sional weak PRDS on HqIP) for any P G V. Consider R the step-up procedure defined 
by Definition 3.2 with /3{x) = x. Then for any P € V , for any h € 'Ho(-P), the couple of 
variables {Uh, V) = (p/i, A(i?)) satisfies (24) and thus DC(/3) holds for P{x) = x . 

Proof. In the above statement and the present proof, we use the shortened notation 
R,Ph, and LAir) for the random quantities R{X),ph{X), and LA{X,r), respectively. 
The goal of the proof is to establish (24), that is for any Hq G Hq {Hq is assumed to be 
fixed in Ho in the rest of the proof), for any t, and < u < u' with F{pho < u) > 0: 

P [A{R) <t\ph„<u]<¥ [A(i?) <t\pi,„< u'] ; 

From Definition 3.2, the real random variable A(i?) can be rewritten as A(i?) = r = 
max{r : f{r) > r} with / : r i-> A{LA{r)). Furthermore, denoting Gu — ^ we 

are equivalently aiming at proving that for any t and < u < u' with V{pho ^ "") > : 

E [l{f < t}Gu] < E [l{f < t}Gu'] . (25) 

By using Lemma 5.3 (and the notation therein) there exists a fixed sequence of finitely 
supported measures A„ on T-L such that, denoting rvi ^ = max{r > : A„(LA(f')) > 
r — A;~^}, it holds that 

r— lim r^ = lim r^ almost surely, (26) 
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f+ = liinsup„^oof„,fc and = liminf„_^oo 
Let Sn be the (finite) support of A„ and 5^ = 5„ U {ft-o}- Writing r„_fe as a function of 
the finite p-value set {ph,h € 5^}, the function f„^fc : z = {zh)heS'^ G [0, 1]"^" 7>i,fe(z) 
is measurable (where the space [0, 1]"^" is endowed with the standard product Borel a- 
field), and is additionally non-increasing in each p- value. Hence the set {z = {zh)hes;^ '■ 
r„^fc(z) < t + k~^} is a non-decreasing measurable subset of [0,1]'^". Using that the 
p- value process p = {Ph,h G H) is finite dimensional weak PRDS on Hq, the p- values 
{ph, h E S^) are PRDS on Hq D which implies that for any t > and u < u' with 
''ho < u) > , 



E [l{f„^fe - k-' < t}Gu] < E [l{f„,fe - fc-i < t}Gu'] 



(27) 



Now, to prove (25), it suffices to carefully make n and k tend to infinity. By Fatou's 
lemma and by (27), we have for all A: > 1 : 



E 



lim inf l{r„ 



k-^ < t}Gu < liminf E [l{f„.fe - k^^ < tjCJ 

J n 

< lim sup E [l{f„^fe - k^^ < t}Gu'] 



< E 



lim sup l{rn,k - k ^ < <}G„' 



Notice that the following inclusions of events hold: {r^ < t + k ^} C liminf„{r„ji; < 
t + k~^}, limsup„{f„^i, < t + k~^} C {r^ <t + k~^}. Hence, we obtain for all k: 

E [l{f+ - fc-i < i}Gj < E [l{f^ - fc-i < t}Gu'] . 

Then, if t is such that P [r = =0, the above expression can be rewritten as 

E [l{f+ - fc-i < t}Gul{r^t}] < E - fc-i < t}Gu'l{r^t}] . 

We now let fc — ?> cxD in the above expression by using (26) and the dominated convergence 
theorem: for any u < u' with V{pho < u) > 0, and any t ^ D := {s > Q : P [r = s] > 0}, 
we have 



E [l{f < t}Gu] < E [l{f < t}Gi, 



(28) 



Since the above expectations may be interpreted as (conditional) probabilities, the LHS 
and RHS in (28) are left-continuous functions of t. Using that n Z?"^ is dense in 
K"*" (because D is at most countable), we obtain that (28) holds for any t. Finally, the 
condition (DC(/?)) comes from Lemma 5.1. □ 



5.4. Finite approximation of step-up procedures 

As usual, to lighten notation R,ph, L j\{r) ,r denote the random quantities R{X),ph{X), L\{X, r)^r{X). 
The following result shows how to derive the continuous step-up procedure (see Defini- 
tion 3.2) from a limit of finite step-up procedures. It is used in the proof of Proposition 5.2. 
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Lemma 5.3. Consider the step-up procedure R ~ Lj^ir) on % using A and with r 
defined in Definition 3.2. Then there exists a sequence of finitely supported measures A„ 
on % such that, denoting 

rn,k = max{r > : A„(iA(f))) >r- fc"^}, 

we have 

r= lim limsupf„^fc I = lim (liminfr„^fc) almost surely. 

Proof. We start with the following observation. Consider (A„) some sequence of mea- 
sures on H such that A„(H) = M. For a fixed realization x £ X{n) of X, we consider 
/ : r e M+ A(LA(a;,r)) and /a„ : r e M+ A„(LA(a;, r))). Clearly, / and /a„ 
are nondecreasing right-continuous functions. Using Lemma S-2.4, we conclude that the 
desired result holds provided that, for P-almost all x £ X, /a^^ converges uniformly to / 
over [0, M-|- 1]. It remains thus to prove that there exists a sequence of finitely supported 
measures A„ on H such that for P-almost all x £ X, 



limsup<^ sup |A„(LA(a;,r))-A(iA(a;,r))| ^ =0. (29) 

Denote y the product space H^, endowed with the product sigma- algebra. For y := 
{hi)i>i £ y some sequence of hypotheses, denote Ajf' = ^'^''^^^ Y^l^^i^K the suitably 
scaled uniform atomic measure on (hi, . . . ,hn). 

Consider now Y :— (i?i)i>i £ y an i.i.d. sequence of hypotheses drawn independently 
of X according to the probability distribution A/M on H. Observe that for any fixed 
x £ X{n), L^{x,r) = {h £% : Ph{x) < aTr{h)l3{r)} ^ {h £ U : q{h,x) < a/3(r)}, 
where 

(ph{x)/iT{h) if7r(/i)>0; 
q{h, x) <0 if 7r{h) — and Phix) = 0; 

[a/3(M + l) + l if Tr{h) ^ and Phix) > . 
Thus, applying the Glivenko-Cantelli theorem to the i.i.d. variables {q{Hi, x))i, we deduce 
that for any a; £ Cix,y) limsup„^^ sup,.g[Q j^^^^] Alf'(LA(a;,r)) - A{LA{x,r)) = 

for Py-almost all realizations y of Y . Observe furthermore that for any fixed r, the 
function 

n 

iu,y) e f7 X A^y\LA{X{uj),r)) = Afn"! ^ (X(t^)) < aTT{h,)l3{r)} 

is a (jointly) measurable function of (w,?;) by assumption (A2). The inside supremum 
in (29) can be restricted to rational numbers since the functions involved are right- 
continuous. Therefore, (w,?/) i— > C(^('^):y) is a jointly measurable function in its vari- 
ables. By Fubini's theorem, this implies that Ex,y [Ci^^ ^)] — 0; ^^d thus also, for 
Py-almost all y £ y, Ci^iU) = foi' P-almost all x £ X. Since an event of probability 

1 is non-empty, there exists a fixed y £ y such that C{x,y) = for P-almost all x £ X, 
which gives rise to a sequence of finitely supported measures A„ satisfying (29). □ 
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6. Discussion 

6.1. FDR control for self-consistent, non step-up procedures 

In some cases, for instance, after a discretization in r or under a global constraint over 
the admissible geometry of sets of rejected hypotheses, the procedure of interest may not 
be of the step-up form, while still satisfying the more general condition (SC(q;, tt, /?)) 
(called self-consistency, see Section 5.1). In that situation. Theorem 4.1 does not apply, 
because the procedure is not step-up. We proved an extension of Theorem 4.1 holding 
more generally for (nonincreasing) self-consistent procedures, but point 1 of the theorem 
is established only under a stronger PRDS condition called general PRDS. (On the 
other hand, the fact that point 2 of Theorem 4.1 remains valid under the more general 
condition (SC(a, tt, /3)) is quite immediate.) The general PRDS condition is defined in 
terms of the entire process X and not only its finite dimensional projections. Therefore, 
it is substantially more technical than finite dimensional PRDS. In particular, it is an 
open question to characterize when does finite dimensional PRDS imply general PRDS 
(we provide some sufficient conditions). For simplicity, we deferred the corresponding 
study in part II of the supplementary material (Blanchard ct al., 2011). 

6.2. Power and adaptive procedures 

This work has focused on procedures ensuring control of the type I error as measured by 
the FDR. Under this constraint, one would like to maximize power. We do not address 
this issue in the present work; a specific multiple testing power criterion would have to be 
defined to begin with, for instance the average number of correct rejections. We briefly 
discuss possible future directions in this regard, in particular adaptivity properties with 
respect to different types of underlying regularity structure. 

Adaptivity of single tests 

The power of a multiple testing procedure depends primarily on the power of the 
underlying single tests and p-values it is built upon. It is of course desirable to design 
individual tests that are as powerful as possible in the first place. While this issue actually 
pertains to the domain of single hypothesis testing, and is to this extent quite independent 
of the methodology studied here, we briefly discuss this issue in the light of the specific 
example of the Gaussian white noise model dZt = f{t)dt + adBt- For designing a test of 
the hypothesis /(^o) = 0, we have assumed known regularity of / and considered a test 
based on a simple kernel estimator. Could this be improved? 

There is an abundance of literature on adaptive testing of a global qualitative hy- 
pothesis on / (the simplest example being testing that / is identically zero), where 
adaptation is understood with respect to the (Holder or Besov) regularity of the alter- 
native and separation from the null is generally measured in some norm. This might 
give some hope that some form of regularity adaptation is possible also for testing the 
local hypothesis /(io) — (and the separation distance |/(io)|), but the situation is 
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in fact quite different and possibilities for tliis are severely limited. This is in essence 
the same phenomenon as for the existence of regularity-adaptive confidence intervals for 
pointwise estimation of a function, as studied by Cai and Low (2004). We sketch the 
main arguments here. First, following the discussion in Diimbgen and Spokoiny (2001), 
Section 2, observe that for testing of /o against g^, the power of the optimal NP test is 
$((T ll/o — (7o|l2)i where <I> is a nondecreasing function. Thus, the optimal power of a com- 
posite null Hq against an alternative Hi is upper bounded (with equality if Hq and Hi 
are convex) by <^{amif^Ua,g(iHi 11/ - siy - 1^ ^^'^ case where Hq = {f & 7", f{to) = 0} 
and Hi = {f <E J^i,f{to) = e}, where J^i C J- are Holder regularity classes, Cai and 
Low (2004) (Example 1 there) establish that the rate behavior as e — > of this infimum 
distance is determined by T and not by Ti. Therefore, no adaptation to the regularity 
of the alternative is possible in this configuration, and it is necessary to assume some a 
priori known regularity class T. On the other hand, these authors show that adaptive 
confidence intervals (and hence tests) exist in this setting provided some additional shape 
restrictions, such as monotonicity, are assumed to hold. 

Adaptivity to n('Ho) and to the dependence structure 

For multiple testing over a finite hypothesis space, recent research has focused on 
improving step- up procedures to take into account, on the one hand, the (unknown) 
volume 11(7^0 (-P)) of true null hypotheses - which comes as a nuisance parameter reducing 
the effective level , see (12), and on the other hand, the dependence structure of the p- 
values. Both directions suggest further possible developments in the continuous setting 
as well. 

Appendix A: PRDS statements 

Lemma A.l. The p-value process p(^) — {pt{X)^t G /} defined by (5) is finite di- 
mensional weak PRDS (on any subset). 

Proof. Let us consider a finite subset (tj)o<cj<N-i of / and D a non-decreasing mea- 
surable subset of [0, 1]^. Let us prove that the function u i-^ P [p{X) e D \ pto{X) < u] 
is nondecreasing on {u G [0,1] : F{pto{X) < u) > 0}. If F(io) e {0,1}, the result is 
trivial. We thus assume that F{tQ) S (0, 1), so that Ut^ — {Gt„{k),k = to, m — 1, . . . , 0} 
contains only increasing points of (0, 1]. Without loss of generality, we only have to prove 
the nondecreasing property for u G l^ta- Since Gt^ is decreasing from {0, . . . ,to} to Uiq, 
we have pto{X) < Gtgik) <=^ mF„(X,to) > k -^=^ X/^k) < k (letting X(o) = -oo). We 
thus have to prove that for any fc, 1 < fc < m, 

P [(X(i), . . . ,X(„)) e D' I < to] > P [(^(1), . . . ,^m) e D' I < to] , (30) 

where Z?' = {a; e M™ : {ptj{x))o<j<N-i G D} is a nondecreasing subset of M™ (because 
p is coordinate wise nondecreasing, i.e., x < x' ^ yt,pt{x) < pt{x')). Using that the 
family of order statistics has positive regression dependency (see Lemma S-2.1), 
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we derive that the function /(a, 6) = E . . . , X(„j)) G D' \ = a, = 6] is 

nondecreasing in a and h. Therefore, denoting 7 = P [-'^(fe) < I ^(fe-i) 5; ^0] ^ 'wc get 

P , . . . , e I X(k-i) < to] = 7E I X(fe_i) < io, ^(fc) < io] 

+ (1 - 7)E [f{X^k-i),X(u)) I ^(fe-i) < <o < X(k)] 
> E [/(-'f(fe-i),-'^(fe)) I -'^^(fc-i) < to,X(^k) < ^o] , 

which provides (30) and concludes the proof. □ 

Lemma A. 2. The p-value process p(^) = {pt{X),t G [0,1]} defined by (6) is finite 
dimensional strong PRDS (on any subset). 

Proof. Let Mt = A^(t+„)Ai - ^(t-.,)vo for any t G [0,1]. Fix {tj)o<j<q-i G [0,1]« 
and assume tg G [77, 1 — 77] (the other case can be proved similarly). Take a nonde- 
creasing measurable set D C [0,1]'' and consider the set D' = {{Mt-)o<j<q-i G : 
(Gt^. (Mi^ ))o<j<g-i G D}, which is nonincreasing on N'? and measurable. We thus aim to 
prove that for any n > 0, 

P [{Mt^ )o<j<5-i eD'\Mt^,=n+l]<V [{Mt^ )o<j<q-i € D' \ Mt„ = n] . (31) 

Denote by Xi < . . . < , Yi < . . . < Yf^^ and Zi < . . . < Z^^ the jump times of the 
process (A^t)tg[o,i] within the (disjoint) subsets [0,to — rj), [tg — rj^tg + rj] and {to + 77, 1], 
respectively. Remark that fcy = Mt^ with our notation. Since (Aft)tg[o,i] is a Poisson 
process, the family {{Xi,l < i < kx , kx), {Yi,l < i < ky , ky), {Zi,l < i < kz,kz)}, 
contains mutually independent elements. Furthermore, the distribution of (Yi, . . . , Ifc^ ) 
conditionally on ky ^ n is equal to the distribution of the order statistics of a sample 
(F/, . . . , Y^) of i.i.d. random variables with common density t H> X{t)/ J[^^_^ to+v] M^)^^ 
on [tn — rj, to+rj] (w.r.t. the Lebesgue measure). Next, denoting It — [(t — 77)V0, (t+rj) Al], 
for any t G [0, 1], we can write: 

¥[iMt.)o<j<g-ieD'\Mt„=n+l] 

' kx n+1 kz \ 

^ i{x, + e + E ^ ^*.} e ^' 

Vi=l 1=1 i=l / o<j<<;-l 

^ fcx fcz \ 

^ G J + ^ e } + E e It J g i?' - (i{K+i g }), 

\,i=l i=l i=l / 0<j<g-l 

^ fcx " kz \ 

\i=l i=l i=l / 0<j<g-l 

= V [{Mt^)o<j<g-i € D' \ Mt„ ^ n] , 

the inequality coming from D' — (l{K^^i G Itj})j C D', because D' is nonincreasing. 
This proves (31) and concludes the proof. □ 
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